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ABSTRACT 

We compute accurate redstart distributions to J^B = 24 and i?AB = 24.5 using photometric redshifts 
estimated from six-band UBVRIZ photometry in the Canada-France Deep Fields Photometric Red- 
shift Survey (CFDF-PRS). Our photometric redshift algorithm is calibrated using hundreds of CFRS 
spectroscopic redshifts in the same fields. The dispersion in redshift is cr/(l + z) < 0.04 to the CFRS 
depth of I^B = 22.5, rising to cr/(l + z) < 0.06 at our nominal magnitude and redshift limits of 
Ij^q = 24 and z < 1.3, respectively. We describe a new method to compute N(z) that incorporates 
the full redshift likelihood functions in a Bayesian iterative analysis and we demonstrate in exten- 
sive Monte Carlo simulations that it is superior to distributions calculated using simple maximum 
likelihood redshifts. The field-to-field differences in the redshift distributions, while not unexpected 
theoretically, are substantial even on 30' scales. We provide i^B ancl R-AB redshift distributions, 
median redshifts, and parametrized fits of our results in various magnitude ranges, accounting for 
both random and systematic errors in the analysis. 

Subject headings: Galaxies: Distances and Redshifts — Galaxies: Photometry — Methods: Statistical — 
Methods: Data Analysis 



1. INTRODUCTION 

The last decade has seen great advances in our un- 
derstanding of the evolution of the Universe, primar- 
ily due to ground breaking studies of the intermediate 
{Lilly et al.ll995b|) and high redshift (jSteidel et alll996ft 
Universe. These "pencil beam" surveys probed to cos- 
mological depths over quite modest areas, revealing the 
strong luminosity, morphological, and clustering evolu- 
tion of the galaxy population over the last 12 Gyr. It 
was, however, recognized at the time that the small win- 
dows on the Universe provided by these surveys were 
subject to substantial sample, or "cosmic" variance, in 
particular for measurements on the scale of the survey 
field sizes. 

Recently the 2dF JColless et alJ l200l and SDSS 
{York et al.l l2000) surveys provided the first precise "lo- 
cal" (z ~ 1) measurements of the galaxy luminos - 
ity function l|Norberg et aUl2002tt iBlanton et, all 2003L 
providing a crucial baseline reference for evolution- 
ary studies. In addition, measurements of the galaxy 
correlation function in th ese surveys {Norberg et al.l 
I2002al iZehavi et ail |2002|1 have convincingly demon- 



strated luminosity-dependent clustering long seen at 
lower significance in smaller samples. These surveys 
finally overcame the cosmic variance issues that had 
plagued previous local surveys by sampling over 10 3 
square degrees. 

With the advent of large format mosaic CCDs, we un- 
dertook the Canada-France Deep Fields (CFDF) survey, 
a deep UBVI imaging survey in four 30' x 30' fields. 
Published results from the CFDF include a study of 
the galaxy angular cor relation function to J^B = 25 
{McCracken et af] l2001). and a measurement of the clus- 
tering properties of Lyman-break galaxies at z ~ 3 
{Foucaud et al.ll2003|) . In this paper we introduce the 
CFDF Photometric Redshift Survey (CFDF-PRS), de- 
rived from a highly uniform UBVRIZ sub-sample of the 
main survey, which forms the basis of several evolution- 
ary measurements of the galaxy population as a function 
of redshift. Future papers (Brodwin et al, in prepara- 
tion) will present studies of the galaxy luminosity and 
correlation functions. 

In this paper we focus on an accurate measurement of 
the galaxy redshift distribution from < z < 1.3. We 
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introduce a novel iterative technique to extract the opti- 
mal redshift distribution using the full redshift likelihood 
functions for each galaxy. We present binned redshift 
distributions along with parametrized fits to our results. 

The outline of this paper is as follows. Section 
briefly describes the CFDF-PRS data. Section $3 de- 
scribes the calculation and calibration of the photomet- 
ric redshifts. Section ^introduces the iterative Bayesian 
method of recovering the redshift distribution using the 
full redshift probability function for each galaxy. In Sec- 
tions SjSJand Sj5]we present the photometric redshift distri- 
butions, obtained using the iterative method, as a func- 
tion of sample limiting magnitude in the CFDF down 
to Tab = 24 and -RaB = 24.5, both in half magnitude 
bins and via parametrized fits. Finally, in Section SJ7| we 
summarize our results. 

Most of our results are independent of cosmology. 
Where necessary we have assumed a concor dance cosmol- 
ogy i n agreement with the recent WMAP l)Snergel et al.l 
12003(1 results Q A } = {0.27, 0.73} and a Hubble con- 
stant h = 0.71 where Hq = 100 h. All distances are 
expressed in comoving Mpc. 

2. CANADA-FRANCE DEEP FIELDS 

The Canada-France Deep Fields (CFDF) is a 
deep, multi-color imaging survey coverin g 1 deg 2 
(Mc Cracken et all 120011 iFoucaud etaTl 12003. hereafter 
CFDF1 and CFDF2, respectively). The acquisition, re- 
duction, and verification of the astrometric and photo- 
metric integrity of the original UBVI data are described 
extensively in CFDF1. This first paper in the series also 
includes extensive completeness simulations showing that 
the survey is ~ 90% complete to i^B — 24. Deep -Bab 
and Jab § a l axv counts along with a detailed angular 
clustering analysis confirm the reliability of the data to 
at least this depth. 

2.1. Supplementary R and Z Imaging 

Photometric redshi ft simulations described elsewhere 
l|Brodwin et alJll999|) indicated the need for additional 
R- and Z-band imaging to produce accurate photomet- 
ric redshifts out to z ~ 1.3. Further deep CFHT imaging 
was obtained in these bands, along with some additional 
/ and B imaging to replace time originally lost to bad 
weather. We replaced the UH8k data with this supe- 
rior, deeper CFH12k I and B data, since combining the 
two datasets would have degraded the excellent CFH12k 
seeing and cosmetic quality and while adding little in 
terms of depth. These data were reduced, stacked and 
calibrated using standard procedures (e.g. bias subtrac- 
tion, flat fielding, fringe removal, geometric distortion 
correction), carefully adapt ed to mosaic C CD imaging 
as described in CFDF1 and lBrodwinl <|2004|) . 

New catalogu es for this work wer e generated using the 
X 2 technique of lSzalav et alJ l(1999l) . The various magni- 
tude limited galaxy samples studied in this paper were 
selected using total magnitudes as measured by the Sex- 
tractor l|Bertin and ArnoutsllT99fih MAG_AUTO param- 
eter, whereas the photometric redshifts are computed us- 
ing 2.5" diameter aperture magnitudes. 

Star/galaxy classification was performed using a hy- 
brid technique that incorporates both morphology (in 
0.6"Z-ba nd images) and 4 color-color planes of informa- 
tion (see iBrodwinl 1200^ for full details). Only objects 



Table 1. Final Photometric Sensitivities in the CFDF-PRS. 
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extremely reliably classified as stars are removed from 
the present analysis. 

2.2. Photometric Redshift Sample 

As explained in CFDF2, [/-band imaging only exists 
in 2.5 of our 4 30' x 30' fields. In addition, data in one of 
the eight UH8k CCDs with which the original data was 
taken is not science grade, due to bad charge transfer ef- 
ficiency. Limiting the survey to those regions containing 
6-band photometry, and excluding the regions masked 
due to bright (and saturated) stars, scattered light, rem- 
nant cosmetic defects that survived the median stacking, 
and the aforementioned unreliable CCD, resulted in a 
final CFDF-Photometric Redshift Survey (CFDF-PRS) 
area of 0.45 deg 2 . Although such conservative cuts re- 
move over half of the original survey area from the PRS, 
the remaining data is of very high quality and uniformity, 
and has the wavelength coverage necessary for robust 
photometric redshifts. The final photometric sensitivi- 
ties for the CFDF-PRS are presented in Tabled 

3. PHOTOMETRIC REDSHIFT ALGORITHM IN THE 
CFDF-PRS 

In order to estimate photometric redshifts and calcu- 
late the redshift probability distributions needed for the 
present study as well as the luminosity and clustering 
analyses of future CFDF papers (Brodwin et al. in prepa- 
ration), we employ a standard empirical template fit- 
ting algorithm, in which filter- convolved empirical galaxy 
template SEDs are fitted to the observed galaxy photom- 
etry to determine the best-fitting redshift and spectral 
type. To obtain robust galaxy colors the photometry in 
each filter is co-convolved to the worst seeing in each field 
(1.3"-1.4"). The photometry is fitted to the templates in 
the flux domain, with the flux error for each object taken 
as the quadrature sum of its Poisson noise and a Monte 
Carlo estimate of the sky error in the aperture. 
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The IColeman et all llT98nL here after CWW) empir- 
ical SEDs, supplemented by the iKinnev et alJ l)1996fi 
SB2 and SB3 starburst SEDs, were adopted as the ba- 
sis templates. This combination of templates has been 
shown to span the r ange of galaxy properties in the HDF 
from < z < 6 l)Bem'tezj 1200(1 Fernandez- Soto et alJ 
1999). The CWW templates were extended into the far 
UV using normalized so lar metallicity GISSEL models. 
The IKinnev et ahl l|1996|) starburst SEDs, well measured 
down to ~ 1100 A, were extended to 912 A via sim- 
ple linear extrapolation. Fifteen interpolates of these 
basis spectra were used to improve the redshift resolu- 
tion. Galaxy reddening is naturally incorporated into 
our empirical templates, as they are each composite spec- 
tra from numerous galaxies with varying levels of extinc- 
tion. Our SEDs are statistically corrected for intcrgalac- 
tic hydrogen abs orption as a function of redshift follow- 
ing M adaul l)1995j) . taking into account both Lyman series 
line blanketing at wavelengths below A b s < 1216 (1 + z) 
A and continuum photoelectric absorption below the Ly- 
man limit, A D bs < 912 (1 + z) A. A maximum likelihood 
method (ML) was used to produce the redshift likelihood 
surface for each object as a function of redshift and spec- 
tral type. 

This code was tested extensively with public HDF 
data and using the hundreds of CFRS spectroscopic 
redshifts in our fields. The latter sample allowed a 
novel form of calibration of the method, described be- 
low, to significantly improve the redshift accuracy. In 
the HDF, the simple ML redshifts matched the excellent 
result s in the literature dBenitezI |2Tj0(jyCojinonyetjdJ 
1 19971 iFernandez -Soto et alJ 12003: iFontana et all 1200(1 
ISawlckT^t^dTIT997fl . as shown in Figure ^ with redshift 
dispersions of a z ~ 0.07 (1 + z) over the full redshift 
range < z < 6 and only o ne "catastrophic error" (for a 
discussion of this object see lFernandez-Soto et alJ [20021. 
The photometric redshifts described in this Section have 
not been subjected to a Bayesian prior. In Section 0] we 
demonstrate how the use of such a prior improves the 
derived redshift distribution. 

1.5 1 i i i i i 
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Fig. 1. — Photometric redshift residuals for spectro- 
scopic galaxies in the HD F North for the sample defined in 
IFernandez-Soto et al.l <200l) . This Fi gure is directly comparable 
to their Figure 7. The dashed lines enclose the lcr error interval, 
which, excluding the single discrepant galaxy, is Az/fl + z) = 0.07. 



3.1. Calibration Using CFRS Spectroscopic Redshifts 

The accuracy of photometric redshift estimation can 
be improved using a spectroscopic training set (see, e.g., 
ICsabai et al"l l2"003). Taking the CFRS galaxies with the 
most secur e spectroscopic redsh ifts (confidence class 3 
and 4, see ILe Fevre et al] 119951 for details), we have a 
training set of 321 galaxies between < z < 1 and 
18.5 < I^B — 22.5 (where the bright limit is imposed 
by the saturation limit of the CFDF-PRS). We use this 
spectroscopic sample as a novel kind of calibrator to im- 
prove the photometric calibration in our three fields, 
thereby producing significantly more accurate redshift 
estimation. 

We examined the residuals between the observed mag- 
nitude and the best-fitting template at the correct (spec- 
troscopic) redshift in each band, as a function of the color 
of the galaxies. As an example, if all galaxies, regardless 
of redshift or spectral type, showed a residual, Am, in 
some band, that would suggest that there was a zero 
point offset of Am in that filter. If Am was correlated 
with the color of the objects, that could indicate an error 
in the filter effective wavelength (the origin of which is in- 
accurate filter transmission or QE curves), or problems 
with the template SEDs. In practice such color terms 
were negligible as expected, whereas small but signifi- 
cant zeropoint offsets were found in several filters. These 
offsets are caused, in part, by band-to-band seeing vari- 
ations not fully removed when we convolve our images 
in each filter to the same seeing. Such residual seeing 
variations would manifest as zeropoint offsets. Almost 
all calculated offsets were less than 0.1 mag, the largest 
being 0.12 mag. An example is shown in Figure[5]for the 
R filter in the 03hr field. 

This optimal calibration produces photometric red- 
shifts with comparable error properties in all three 
CFDF-PRS fields, as shown in Figure The redshift 
dispersion, in the common measure in which the error 
increases as (1 + z) is cr 2 /(l + z) = 0.042 for galaxies 
within the iteratively clipped 3 a limits of the distribu- 
tion (solid circles) . The direct dispersion for this sample, 
which includes over 92% of the galaxies is a z — 0.062. 

It is important to stress that no bias was introduced 
to the CFDF-PRS through the use of the CFRS spec- 
troscopy. The true redshifts were used to transform the 
CFRS galaxies present in the CFDF-PRS into photo- 
metric standards, which were in turn used to tweak our 
zeropoints. Given the spectroscopic redshifts, the pro- 
cess is completely internal to the CFDF-PRS photome- 
try, and results in optimal photometric redshifts for our 
depth and wavelength coverage. 

3.2. Photometric Redshift Accuracy for 1 ^ > 22.5 
Galaxies 

In order to test the accuracy of photometric redshifts 
at magnitudes fainter than the CFRS data, we added a 
scaled version of each image in each field back to itself, 
with a positional offset larger than the full CFRS field. 
Care was taken to ensure that the noise properties of the 
resulting images were identical to those of the original 
images. Specifically, in each field and filter we multiplied 
the original image by a scale factor, /i, translated this 
scaled image by ~ 10', and added it back to the original 
image. We then divided this new image by (1 + ) - 5 to 



4 



Brodwin et al. 



1.0 r 



0.5 



Zeropoint Offset = 0.078 



0.0 



-0.5 



-1.0 L 
o.o 



0.2 



0.4 



0.6 



0.8 



1.0 



1.2 



1.4 



R-I 



Fig. 2. — Residuals between the measured R magnitudes and the 
best-fitting template SEDs redshifted to the spectroscopic redshifts 
versus galaxy color. This technique essentially identifies the opti- 
mal photometric zero point and the effective wavelength in each 
broadband filter, using the CFRS galaxies as photometric stan- 
dards. In this 03hr R band plot, the zeropoint offset is ~ 0.08 
magnitudes. 
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Fig. 3. — Residuals in photometric vs. spectroscopic redshift 
for the CFRS spectroscopic galaxies in each field and for the full 
survey. For almost 93% of I AB < 22.5 CFRS galaxies that fall 
within the iteratively clipped 3<r limits of the distribution (solid 
circles) the redshift dispersion is <r ~ 0.06. In the commonly used 
cr/(l + z) measure the dispersion for this sample drops to ~ 0.04. 
Objects represented by open circles were rejected as outliers for 
calculation of the dispersion. 



restore the original image's noise properties. The scale 
factor, fx, is chosen such that in the final image CFRS 
galaxies with J^g = 22.5 are transformed into T^B = 24 
objects (i.e. a 1.5 magnitude change — > fx « 0.2595). 

These fainter CFRS galaxies were photometered, and 
photometric redshifts were computed as above. The re- 
sults, shown in Figure 01 demonstrate that while the in- 
creasing photometric error only modestly affects the red- 
shift dispersion in each field to I^B ~ 24, the fraction 
of outliers in our 3cr clipping algorithm increases to as 
much as ~ 15%. Clearly just using the maximum like- 
lihood values of the redshift is problematic in statistical 
analyses of these faint galaxies. 



While this exercise demonstrates the effect of increased 
photometric errors on redshift estimation, it implicitly 
assumes that i^B < 24 galaxies are basically just fainter 
copies of CFRS galaxies. It does not address the pos- 
sibility that at this fainter flux limit a new, distinct 
galaxy population may emerge, one which may be in- 
trinsically unlike our empirical galaxy templates. Photo- 
metric redshift accuracy for such galaxies would likely be 
quite poor, and the paper's main results could be sub- 
stantially compromised. However, no evidence for such a 
hypothetical population is seen in the public HDF, where 
the spectroscopic sample extends to 7^B ~ 25 and be- 
yond. Indeed the excellent photometric redshift accu- 
racy achieved by our code in this s ample indicates that 
the CWW and lKinnev et all l)199 6) templates provide a 
good description of galaxies to very faint magnitudes over 
< z < 6. We conclude that the effect on our primary 
results by any fundamentally different galaxy population 
which may emerge between 22.5 < Ij^q < 24 is minimal. 
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Fig. 4. — Residuals in photometric vs. spectroscopic redshift for 
galaxies to /ad < 24.0. The difference in imaging depths between 
the three CFDF-PRS fields is becoming apparent. In the 22hr field, 
the redshift dispersion and number of outliers is largely unchanged 
from the brighter sample in Figure [3] In contrast, the number of 
outliers in our 3<r clipping algorithm has doubled to 17% in the 
14hr field. The redshift dispersions, in each field and overall, are 
still less than a ~ 0.1 to Tab ~ 24.0. 



3.3. Photometric Redshift Accuracy for z ~ 3 Lyman 
Break Galaxies 

To test the photometric redshift algorithm at red- 
shifts much higher than the CFRS, we have used the 
ISteidel et alJ lj2003T) spectroscopic sample of z ~ 3 Ly- 
man Break galaxies (LBGs) present in our 14hr field 
(the "Groth strip"). We photometer the LBG positions 
in this field and compute redshifts for objects with S/N 
> 3. Figure shows the redshift residuals for the 104 
spectroscopic LBGs above this limit. This sample, be- 
ing extremely limited in redshift space, is instructive in 
exploring the occurrence of catastrophic redshift errors. 
The filled symbols are well measured redshifts, whereas 
the open symbols, representing ~ 13% of the sample, are 
catastrophic failures. Neglecting these the photometric 
redshifts at z ~ 3 have a dispersion ofcr/(l + z)~0.05. 

The catastrophic failures in this LBG sample stem 
from the difference in the U filters between the two sur- 
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veys. The lSteidel et alJ (|1996f) bluer U n filter allows se- 
lection of robust LBG candidates at slightly lower red- 
shifts than our U filter (in which the Lyman continuum 
break i s only centered at z ~ 3.2). The extremely faint 
z ~ 2.8 S teidel et alJ ((2Q03) LBGs, without a strong con- 
tinuum break fully bracketed by our filters, suffer a de- 
generacy between the z ~ 2.8 starburst and < z < 0.5 
redder galaxy types. Approximately 20% of the faint 
2.7 < z < 3.0 LBGs are misclassified as lower redshift 
galaxies. 

It is important to note that in this work, in addition 
to only using objects with S/N > 10, we are restricting 
ourselves to a redshift range which always contains the 
4000 A break. 
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Fig. 6. — Typical likelihood functions with la, 2a, and 3c con- 
fidence intervals for increasingly fainter galaxies. At the survey 
limit the likelihood functions become multiply-peaked, leading to 
disjoint, but nevertheless statistically valid confidence intervals. 



Table 2. Statistics of confidence intervals derived from redshift like- 
lihood FUNCTIONS FOR CFRS SPECTROSCOPIC GALAXIES. 



Confidence 


Observed 


Expected 


Interval 


Number 


Number 



F ig. 5. — Residuals i n photometric vs. spectroscopic redshift for 
104 St eidel et al.H 2003) Lyman Break galaxies with S/N > 3 in the 
CFDF-PRS 14hr field. Excluding catastrophic errors (open sym- 
bols), the dispersion in redshift is a/ (1 + z) ~ 0.05. The galaxies 
with catastrophic errors, roughly ~ 13% of the sample, are clus- 
tered below 2 ~ 3 where our filters do not fully straddle a strong 
continuum break. 



3.4. Statistical Validity of Redshift Likelihood 
Functions: Confidence Intervals 

In Section 21 we introduce a new method of calculat- 
ing N(z) in photometric redshift surveys using the full 
redshift likelihood function for each galaxy. Here the 
error properties of the likelihood functions are charac- 
terized using the CFRS spec troscopic sample. Following 
iFernandez-Soto et alJ <)1999D we define the 1,2, and 3-a 
confidence intervals as those redshift regions which en- 
close the top 68.3%, 95.4%, and 99.7% of the area in our 
normalized redshift likelihood functions. At this point 
the likelihood functions have not been subjected to any 
prior (i.e there is equal a priori likelihood at all redshifts). 

We found that convolving the likelihood functions with 
a Gaussian with a = 0.04 produced approximately nor- 
mal confidence level statistics, as shown in Tabled and 
has the beneficial effect of smoothing the likelihood func- 
tions. We note that the smoothing kernel is less than our 
redshift dispersion of ~ 0.06 measured with the CFRS 
spectroscopic sample. Examples of representative likeli- 
hood functions for Jaj3 — [21, 22, 23, 24] galaxies are 
shown in Figure 

Brighter than Ij^-q < 23.5 the likelihood functions are 
overwhelmingly single-peaked, approximately Gaussian 
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probability distributions, with relatively little probability 
density in secondary peaks. At the survey limit, however, 
the secondary peaks become increasingly significant, of- 
ten leading to multi-moded, disjoint confidence intervals 
even at the lcr level. 

The confidence level statistics are nevertheless quite 
consistent with a normal error distribution in the Pois- 
sonian limit, as shown in Table [3 although we place a 
slightly higher fraction of galaxies beyond the 3-er er- 
ror limit. The Poisson errors on the expected nu mber of 
galaxies are computed using the iGehrelsl l|1986fl expres- 
sions valid for small numbers of events. This exercise 
verifies the statistical validity of our redshift likelihood 
functions and of the analysis presented in Section ^51 that 
incorporates them. 

4. METHOD: CONSTRUCTING PHOTOMETRIC REDSHIFT 
DISTRIBUTIONS 

For high S/N photometry like that in the HDF, where 
the redshift likelihood functions are overwhelmingly nar- 
row and singly-peaked, it is sensible to simply adopt the 
redshift which maximizes the likelihood as the photomet- 
ric redshift. However, an open question is how to best 
measure photometric redshift distributions in the lower 
signal-to-noise photometry typical of ground-based sur- 
veys. The present method addresses this issue in the 
CFDF-PRS and in anticipation of upcoming wide-field 
near-HDF depth imaging surveys (such as the CFHT 
Legacy Survey). 
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Detailed Monte Carlo simulations (described below) 
demonstrate that simple maximum likelihood photomet- 
ric redshift methods increasingly fail to reproduce the 
underlying redshift distribution at S/N < 20. This Sec- 
tion describes a Bayesian method for recovering the un- 
derlying N(z) at still more modest S/N levels using the 
ensemble redshift information present in the data. 

4.1. Bayesian Photometric Redshift Distributions 

As described in detail in iBemted 1)2000(1 . Bayesian 
statistics provide a way to incorporate our knowledge 
about the galaxy distribution to improve the photomet- 
ric redshift accuracy. This is accomplished through the 
use of priors on the simple likelihood functions, which by 
nature assume all redshifts are a priori equally likely. The 
prior converts the likelihood functions into true Bayesian 
redshift probability distributions which can be incorpo- 
rated directly into cosmological studies in place of tradi- 
tional spectroscopic redshifts (which, in general, can be 
thought of as approximately (5-function redshift proba- 
bilities). 
Specifically, 

P(z,T) <xP[z,T\N(z)] x C(z,T) (1) 

where C(z, T) is the likelihood of measuring the observed 
colors for a galaxy of spectral type T at redshift z. In the 
Bayesian interpretation P[z, T\N(z)] is a prior, in this 
case the probability of drawing a galaxy with redshift 
z and type T from a redshift distribution N(z). This 
prior, applied to the likelihood function, incorporates the 
basic fact that all redshifts are not a priori equally likely. 
As the goal here is to measure a statistical property of 
the galaxy population, namely its redshift distribution, 
requiring this information a priori may appear somewhat 
circular. However, we have devised an iterative scheme 
to recover the underlying redshift distribution and tested 
it in detailed Monte Carlo simulations. 

4.2. Direct summation of N(z) and Iteration 

Rather than adopting the simple maximum likelihood 
(ML) value of each galaxy redshift, we choose instead to 
directly sum up the normalized likelihood functions, C n , 
of each galaxy in the photometric sample. This would 
trivially produce the correct redshift distribution in the 
limit of spectroscopic (i.e. (5-function) input redshifts. 
For photometric redshifts in the high signal-to-noise 
regime this method yields approximately the same result 
as collapsing the C n to the single maximum likelihood 
redshift since the C n are typically single sharp Gaussians. 
At fainter magnitude limits, the likelihood functions be- 
come broader and often develop multiple peaks as degen- 
eracies between distinct templates at different redshifts 
arise with the increasing photometric error. With sim- 
ple ML redshifts the derived N(z) is both too broad and 
has artificial peaks as significant numbers of galaxies are 
shifted between these degenerate redshifts. This is true 
as well of the first step of our iterative method: 

n°(z)=j2c?(z)=y: f 4tw • (2) 

i i Jo ^( z )dz 

This zeroth order redshift distribution, N (z), is then 
used as the prior, P[z, T\ N(z) ], in an iterative solu- 
tion to equation^ This prior uses the bulk information 



present in the galaxy colors to modify the raw likelihoods 
so as to minimize the degeneracies and, in effect, decon- 
volve the individual likelihood functions of the low S/N 
broadening effect. The resulting distribution serves as 
the first order prior, N (z), and the procedure can be 
carried on to convergence, which, as shown in the follow- 
ing Section, is so rapid that additional steps are unnec- 
essary. 

This method makes the assumption that a single prior 
is applicable to galaxies of all brightnesses and spectral 
types in a given sample. Ideally, each population of 
galaxies should have its own unique prior to optimally 
weight the simple likelihood functions. Although the use 
of a single statistical prior, in this case the iteratively 
estimated redshift distribution, is obviously a rough ap- 
proximation, we demonstrate that it works surprisingly 
well. This should not be mistaken to imply that redshifts 
of individual galaxies are uniformly improved with this 
method. It is the complete statistical redshift distribu- 
tion that is demonstrably improved compared with that 
obtained via the standard method of collapsing the red- 
shift likelihood function of each galaxy to a single maxi- 
mum likelihood redshift. 

Obviously the method breaks down at very low signal- 
to-noise. Extensive testing of the convergen ce properties 
of the method, presented in lBrod wiu ( 2004) and summa- 
rized in the Appendix, indicates that the convergence is 
excellent over the range 0.2 < z < 1.3 for S/N > 10. This 
is confirmed in the simulations described in the next Sec- 
tion. 

4.3. Test of the Method: Monte Carlo Simulation 

The iterative method described above was extensively 
tested in detailed Monte Carlo simulations. Briefly, 
model Universes were popula ted according t o CNOC2 
evolving luminosity functions l|Lin et al.lll999ft for early, 
intermediate and late spectral types, modified to pro- 
duce a range of reasonable input redshift distributions 
to z > 3. These model galaxies were assigned colors 
using the empirical template set used in the CFDF pho- 
tomet ric redshift code, ran ging fr om the CWW Ellip- 
tical (|Coleman et all 11980ft to the iKinnev et all l|199fifl 
SB2. Flux errors matching the UBVRIZ photometry 
in the CFDF fields were applied to these galaxies, and 
photometric redshift likelihood functions were computed 
for galaxies in an area of 0.5 deg 2 . A systematic er- 
ror not accounted for in this simulation is the possi- 
ble incompleteness of the adopted template SEDs with 
respect to the true range of galaxy properties. This 
w as approximately acco u nted for following the method 
of iFernandez-Soto et alJ l|1999ft by convolving the com- 
puted redshift likelihood functions with a variable-width 
Gaussian, 

/■oo 

C c (z)= dz 1 C{z') G[z\z',-£(l + z')], (3) 
Jo 

where G is a normalized Gaussian (truncated at z < 
0) of median z' and a = S(l + z 1 ). X was set to 0.05 
based on a detailed comparison with CFRS spectroscopic 
redshifts, as described in Section ^13.11 (in fact, this is 
~ 20% larger than was found for the CFRS, but was 
adopted in a conservative vein). 
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4.3.1. Simulation Results 

In the interest of clarity we briefly recall the various 
methodologies this simulation was designed to test. The 
basic technique for computing redshift distributions con- 
sists of making a simple histogram of the best-fitting 
photometric redshifts for each galaxy. This is identical 
to taking the redshift that maximizes the likelihood func- 
tion in likelihood-based fitting algorithms. We therefore 
dub this the Peak-ML method. Our new method, essen- 
tially a two-step iterative process, uses the full redshift 
likelihood function for each galaxy in computing N(z). 
We therefore label it the Full-RLF iterative method 
(sometimes referred to simply as the RLF method for 
brevity). Redshift distributions were computed using 
both the Peak-ML method and the Full-RLF iterative 
method, and compared to the input N(z). While the 
final iteration of the Full-RLF method is superior down 
to I^B — 24.0, the improvement is most pronounced for 
the I ays — 23-5 sample, which we plot in Figures and 
□ 

Figure [7| plots the redshift distributions obtained with 
the various methods. The filled and open circles repre- 
sent the input (model) and Peak-ML redshift distribu- 
tions, respectively. The dashed and solid lines show the 
Initial and Final iterations of the RLF method, respec- 
tively. 

At first glance, all the methods appear to reproduce 
the input N(z) quite well. However, upon closer exam- 
ination, it is clear that the Initial RLF N(z) is inferior 
to both the Peak-ML and the Final RLF distributions. 
On a qualitative level, we see that an incorrect secondary 
peak at z ~ 1.9 draws galaxies away from the main peak 
at z ~ 0.7. This problem is most acute for the Initial 
RLF method, but is nevertheless evident in the Peak- 
ML points. The Final RLF method appears to suffer less 
from the effect of this photometric redshift degeneracy. 

In fact, the trends observed with the different methods 
can be easily understood. Objects with little redshift 
ambiguity (i.e. narrow, singly-peaked redshift likelihood 
functions) contribute to the different N(z) measures in 
the same, correct way. On the other hand, those ob- 
jects whose redshift likelihood functions show a strong 
degeneracy (i.e. at least two similar peaks at distinct, 
aliased redshifts) contribute to N(z) in quite different 
ways in the above methods. In these cases the Peak-ML 
method usually selects the correct redshifts, although the 
catastrophic fraction increases with decreasing S /N. The 
main features of the input N(z) are therefore fairly well 
reproduced, although small peaks from the catastrophic 
errors will arise at specific aliased redshifts. The Initial 
RLF method, as the direct sum of the likelihood func- 
tions, will always underpredict the main N(z) peak as 
all the probability density of incorrect peaks at aliased 
redshifts is removed from it, leading to quite large peaks 
at the aliased redshifts. The Final RLF method, which 
uses the Initial RLF distribution as a prior, strongly sup- 
presses these incorrect peaks using the information con- 
tained in the bulk of the galaxy sample (which is, on the 
whole, correct), bringing most of the ambiguous redshifts 
into line. A quantitative analysis demonstrates the su- 
periority of the Final RLF method, and shows that it 
represents a significant improvement over the Peak-ML 
method. 



To quantitatively assess the different methods we com- 
pare the resulting N(z) of each method to the input N(z) 
using a x 2 test. To make a fair comparison, we integrate 
the iterative method curves over each Peak-ML redshift 
bin to produce a binned version of the iterative method 
for this test. The Initial RLF redshift distribution has a 
reduced xl = 14-5 when compared with the input N(z). 
The Peak-ML method, unsurprisingly, is much better, 
with xl — 4.02. It should be remembered that pho- 
tometric redshift errors are not formally Gaussian. In 
fact in the low S/N regime, where multiple peaks arise, 
the error distribution is patently non-Gaussian, render- 
ing impossible a direct confidence interval interpretation 
of the absolute % 2 values. However, regardless of the 
nature of the photometric error distribution, reductions 
in x 2 should indicate relatively better fits. In the Final 
RLF method the distribution no longer suffers from the 
effects of the redshift aliasing. The curve runs through 
(or very near) all the input N(z) data points, and the xl 
drops to 1.25. 

Figure |S1 shows the individual contribution from each 
redshift bin to the total value of xl f° r each method. 
The dramatic improvement from the Initial RLF (empty 
histogram) to Final RLF (densely filled histogram) is evi- 
dent. The resulting N(z) has smaller deviations from the 
input distribution in every bin compared with the Peak- 
ML redshift distribution (sparsely filled histogram). 

It is interesting to note that in this simulation the itera- 
tive method accurately recovers the redshift distribution 
outside the "guaranteed-convergence" redshift range of 
0-2 z S 1-3 (see Appendix). Various input redshift 
distributions and binnings have been investigated, with 
results consistent with those shown in Figure 

In addition to the x 2 tests Kolmogorov-Smirnov (K-S) 
statistics were computed for all methods. The same pat- 
tern emerges, in which the Peak-ML method produces a 
redshift distribution more consistent with the input dis- 
tribution than the Initial RLF method, but much less 
consistent than the Final RLF method. In this test the 
results are not impressive in either of the former two 
cases, with the somewhat better Peak-ML redshift dis- 
tribution matching the input distribution at a probability 
of only 10~ 3 . This is perhaps not surprising given the rel- 
atively high x 2 test result. On the other hand, the final 
iteration of the Full-RLF method has a K-S probability 
of 0.31 of matching the input distribution, which is the 
level of variation expected in a Poisson sampling of the 
underlying distribution. 

5. REDSHIFT DISTRIBUTIONS IN THE CFDF-PRS 

The iterative method was applied to the CFDF-PRS 
data described in Section At bright magnitudes our 
results can be compared directly wit h the CFRS redshift 
distributions ijCramoton et al]ll995t hereafter CFRS5). 

5.1. Comparison with the CFRS 

The three CFDF-PRS fields contain 3 of the 5 small 
fields in which the CFRS survey was carried out. We can 
therefore compare the redshift distributions of each field 
individually, as well as the aggregate results for the two 
surveys to the CFRS limit of J^B = 22.5. It is important 
to stress that these surveys are completely independent. 
The imaging data was taken with different cameras and 
filters, and the photometric calibrations are entirely inde- 
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Fig. 7. — Results of Monte Carlo simulations testing Peak— ML (open circles), Initial RLF (dashed curve) and Final RLF (solid curve) 
methods for computing N(z) in a photometric rcdshift survey. The filled circles show the input (model) N(z). The Peak-ML and Final 
RLF methods both recover the underlying N(z) quite faithfully, but the latter is in fact significantly better as discussed in the text and 
illustrated in Figure|S] Note how the aliased peak at z ~ 1.9 in the Initial RLF iteration is strongly suppressed in the Final RLF distribution, 
reproducing the model N(z) better even than the Peak— ML method. 



pendent. The CFRS magnitudes ar e isophotal. whereas 
in th e CFDF we use SExtractor ijBertin and Arnoutsl 
I1996H total magnitudes (MAG_AUTO). 

Figure El compares the 18.5 < Ij^-q < 22.5 redshift dis- 
tributions in each common field as well as for the sum 
of the fields. The solid curve is the Final RLF redshift 
distribution, the open circles represent the Peak-ML 
method, and the histogram is the original CFRS redshift 
distribution. The agreement in all three fields is remark- 
able, though not at all tautological. To this depth the 
CFDF-PRS contains over 20 times the number of CFRS 
galaxies in each field. Nevertheless the striking field-to- 
field differences in N(z) discussed in the CFRS persist in 
the present much larger survey. We have conducted tests 
similar to those in the CFRS ijCrampton et al.l Il995|) 
and conclude that these differences are not indicative 



of anisotropies beyond those attributable to small-scale 
galaxy correlations. This impressive agreement between 
the CFRS and the CFDF-PRS to J AB < 22.5 is a firm 
demonstration of the excellent control of systematic er- 
rors in the survey, as well as of the accuracy of the pho- 
tometric redshift code and the iterative N(z) algorithm. 

5.2. CFDF-PRS I AB < 24 Redshift Distribution 

Pushing beyond the CFRS limit, we present in Fig- 
ure EI] the redshift distribution to I^B — 24 for the full 
CFDF-PRS. Once again the curve is the final iteration 
of the Full-RLF method, and the Peak-ML distribution 
is represented by the open circles. As expected, the main 
difference between these methods in the 0.2 < z < 1.3 
convergence region (between the vertical dashed lines) 
is a relative paucity of objects in the Peak-ML method 
near the median redshift as these objects were scattered 
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Fig. 8. — Contribution to xi from each redshift bin for the Peak-ML (sparsely filled histogram), Initial RLF (empty histogram) and Final 
RLF (densely filled histogram) methods. Note the significant improvement over the Initial RLF method provided by both the Peak-ML 
and Final RLF distributions. The advantage of the Final RLF over the Peak-ML method is also apparent, as the differential xt °f t ne 
former is lower in each bin, often dramatically. The total xt (i- e - summing over redshift) for these samples is {Initial RLF, Peak-ML, 
Final RLF} = {14.5, 4.02, 1.25}. 



to one of the photometric redshift degeneracies at z ~ 1.5 
and z ~ 2.7. The Full-RLF partially corrects this effect, 
though a small fraction of objects remain clustered at 
these redshifts. 

For comparison we plot in the inset the e xpected z ~ 3 
Lyma n Break Galaxies to J^B — 24 from ISteidel et aD 
(1999), along with the number of LBGs from CFDF2 to 
this depth, both scaled to a one degree field siz e. As ex- 
plained in Section H3.3I the ISteidel et alJ l)1996[) filter set 
selects slightly lower redshift LBGs than in the present 
survey, leading to the small 2.5 < z < 3.0 peak of aliased 
redshifts. As we focus on the 0.2 < z < 1.3 convergence 
region in this work, we refer the reader to CFDF2 for a 
more detailed discussion of the low redshift interlopers in 
the Lyman Break sample in the CFDF. We simp l y note 
that the LBG densities for both the Stci del et alJ IjlT 



and CFDF2 samples lie roughly between the densities of 
the Peak-ML and Full-RLF methods, offering a measure 
of proof that the present results are reasonable even at 
z>1.3. 

The distinct characteristics of each field, as well as the 
evolution from the brighter 18.5 < I^B — 22.5 sample 
are discussed below. 

5.3. N(z) as a Function of Survey Depth 

Comparing Figures El and El we see several trends 
in the transition from the CFRS depth to the CFDF- 
PRS limit of ~ 24. As expected, the z > 1 sam- 
ple grows as a fraction of the general population, from 
~ 9% to ~ 30%, while the median redshift increases from 
(z) ~ 0.6 to (z) ~ 0.8. Despite the smoothing effects of 
photometric redshift errors, discrete structures in red- 
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Fig. 9. — CFDF-PRS and CFRS redshift distributions over the common magnitude range 18.5 < Tab < 22.5. The curves are the 
Full-RLF final iteration, the open circles represent the Peak-ML distributions, and the histograms are from the CFRS. In the individual 
fields the CFRS number densities are scaled to the CFDF values, whereas in the combined fields we plot the CFRS distribution using 
their original normalization. The fraction of unidentified CFRS objects is represented by the box to the left of the peak. Stars have been 
removed from both samples. 



shift space are still clearly seen. In this Section we give 
a brief description of the evolution of these structures in 
each field with survey depth. It is interesting to note 
that these field-to-ficld differences remain significant at 
the full survey depth, even on 30' scales. 

5.3.1. 03hr Field (RA: 03 h 03 m 00" DEC: +00°09'00"J 

The 03hr field has considerable structure at 0.6 < z < 
0.7, including a probable cluster near the middle of the 
field. This is confirmed in the CFRS spectroscopic cat- 
alog which contains at least two physical associations of 
galaxies, each with about 15 galaxies within dz < 0.025 
of systemic redshifts of z ~ 0.605 and z ~ 0.702. Inter- 
estingly, the CFRS did not cover the probable cluster (it 
being outside their survey area), but its presence strongly 
influenced the large scale structure they observed in this 
field. In the CFDF-PRS these two redshift space struc- 



tures influence the redshift distribution in this field down 
to Tab = 24, resulting in a relative paucity of galaxies at 
0-3 ^ z < 0.5, compared with the other fields and with 
the combined redshift distribution. 

5.3.2. 14.hr Field (RA: li h n m lQ s DEC: +52°24'43"j 

The 14hr field (Groth S trip) is one of the best stud- 
ied extragalactic fields (e. g. lGroth et alll9 94: Lill y et al] 
H995aHSteidel et al.l20 03). It is the most normal, in some 
sense, of our three fields. The picket fence structure ob- 
served in the CFRS has been smoothed out by the pho- 
tometric redshift errors, while the thirtyfold increase in 
area provided by the CFDF-PRS has reduced the cos- 
mic variance, leaving a relatively smooth, single-peaked, 
roughly Gaussian redshift distribution for z < 1.3. This 
distribution does not change appreciably in shape to 
J^b = 24, aside from steady increases in median red- 




Fig. 10. — CFDF-PRS redshift distribution over magnitude range 18.5 < Iab < 24.0. The solid cu rve and open circles represent the 
Final RLF and Peak-ML redshift distributions, respectively. Also plotted are the lSteidel et all 119991) and CFDF2 LBG counts (dashed 
error bar) integrated to Iab = 24 for comparison. The horizontal error bars represent the redshift sensitivity ranges of the respective LBG 
samples, whereas the vertical error bars show the 1— a errors in the integrated counts. 



shift and number density. However, at z ~ 1.8 we ob- 
serve a second peak arise in N(z) as we push to fainter 
magnitudes. Visual inspection of the galaxies involved 
indicates nothing unusual — they are entirely likely to 
be at the photometric redshift. However as this peak is 
outside our trusted redshift range of 0.2 < z < 1.3 we de- 
fer a detailed study of it until wide-field infrared imaging 
or spectroscopi c redshifts are ava ilable. 

The DEEP2 <|Davis et alJl2003|) redshift survey is tar- 
geting this field with the aim of obtaining 5000 redshifts 
to Iab < 24.5. As such, it will perform a valuable check 
on the results presented here. 

5.3.3. 22hr Field (RA-. 22 h l7 m 48 a DEC: +00° 17' 13"; 

The 22hr field is perhaps the most interesting of the 
three. There is a large overdensity at z ~ 0.3 observed in 
both the CFRS and the CFDF-PRS to the CFRS limit 
of Tab — 22.5. Between 22.5 < I^B — 24 the situation 



changes dramatically, and a broad peak at z ~ 0.8 (the 
median redshift at this depth) emerges. The shape of 
N(z) in this magnitude interval is quite similar to that 
in the 03hr field, albeit at slightly higher redshift. As 
the sample selection deepens from 18.5 < I^B — 22.5 to 
18.5 < Iab — 24 the relative sizes of the z ~ 0.3 and 
z ~ 0.8 peaks completely reverse. In addition, there is 
evidence for a substantial galaxy overdensity at z ~ 1. 

The VIRMOS-VLT deep spectroscopic survey (VVDS, 
iLe Fevre et alJ I2003D will obtain tens of thousands of 
spectroscopic redshifts to I^B < 22.5 over 3.6 deg 2 in 
this field. Given the relatively bright magnitude limit, it 
will be instructive to see whether this is sufficient area 
to obtain a redshift distribution unbiased by the z ~ 0.3 
structure seen in our data, and whether it will be deep 
enough to confirm the existence of considerable structure 
at z ~ 1. 
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5.4. Median Redshifts in the CFDF-PRS 

Several statistical studies including the extraction of 
cosmological parameters in weak lensing shear studies 
only require, to good approximation, the low-order mo- 
ment s of the galaxy population (e.g. iBernardeau et al"l 
119971) . Due to the sensitivity of the mean to occasional 
systematic photometric redshift errors, we instead com- 
pute median redshifts in various cumulative and differen- 
tial magnitude bins. Tomographic weak lensing studies 
can use such information to improve the measurement of 
cosmologi cal parameters by up to an order of magnitude 
fflulll999ft . 

While the median is a robust statistic, there are two 
sources of error which may nevertheless bias it in a 
photometric redshift survey such as CFDF-PRS. The 
first, cosmic variance, applies equally well to traditional 
spectroscopic surveys. It is a sort of random error, 
in the sense that it can be minimized by observing 
many fields. Following lEfstathiou et alJ (Il991fl fsee also 
iGroth and Peebleslll977l iSomervill" et al. 2003), we cal- 
culate the fractional variance in the counts due to small- 
scale clustering to be a 2 ~ 2%, or a m 15%. This agrees 
quite well with the maximum empirically measured field- 
to-field variations given in Tables El and H although the 
uncer tainties in the latter estimates are thems elves of or- 
der J 2/(n-l) = 100% for our n = 3 fields l)Pen et all 
2003) . The Poisson noise is negligible for samples as large 
as this and therefore, in the absence of systematics, the 
cosmic variance is expected to dominate the error bud- 
get. 

The second source of error, catastrophic failures of the 
photometric redshift method caused by redshift alias- 
ing in low S/N photometric data, is systematic in na- 
ture. The simulations of Section |4~3*1 indicate the redshift 
regime which is free of such aliasing (0.2 < z < 1.3), and 
we in fact use this convergence region in fitting our red- 
shift distributions in the next Section. However, catas- 
trophic errors do take place outside this convergence re- 
gion, as is clear from Figure [5] and we would like to 
investigate the effect of these errors on the median of the 
full sample. 

We make two estimates of the systematic error due 
to redshift aliasing. The first, and most conservative, 
adopts the worst-case scenario that all redshifts outside 
the convergence region are untrustworthy, which from 
Section t!3.3l we know is not the case. We compute the 
median omitting all objects outside the < z < 1.3 
convergence region, using two one-sided redshift cuts 
(i.e. z > 0.2 and z < 1.3) to estimate the asymmetric 
maximal systematic error possible in the survey. A more 
realistic estimate of the systematic error is estimated via 
simulations like those described in Section t!3.2l The 
CFDF-PRS I^B an d RaB median redshifts, along with 
all of the error estimates are listed in Tables [3| and re- 
spectively. We also provide in these Tables median red- 
shifts in differential (half) magnitude bins. Given these 
results, simple apparent magnitudes cuts in future imag- 
ing surveys can be used as proxies for photometric red- 
shift information in tomographic weak lensing studies. 

The complete error on each median redshift consists 
of the random error (a quadrature sum of the boot- 
strap and field-to-field errors) as well as the best es- 
timate of the asymmetric systematic errors. For the 



18.5 < Jab — 24 sample the median redshift is therefore 
(z) = 0.77 ± 0.049 (random) (systematic), where 

we use angle brackets to denote the median. We in- 
clude median redshifts for the 24.0 < T^B — 24.5 and 
24.5 < i?AB < 25.0 samples which are formally outside 
our trusted magnitude range for accurate photometric 
redshifts, as can be seen by the sudden rise in the system- 
atic error for these samples. Nevertheless, as the median 
is a reasonably robust statistic and of significant value 
to weak lensing studies, we tentatively include these two 
medians for use until deeper photometric redshift surveys 
become available. 

An important point to realize in interpreting Tables |21 
and 0] is that the errors on the medians would persist 
in an ideal spectroscopic survey. The random error esti- 
mates are not specific to the photometric redshift method 
and would be unchanged for a similar spectroscopic sam- 
ple. However, the spectroscopic redshift samples pub- 
lished to date do not approach the combination of field 
size, area and depth of the present survey, and have 
therefore been more affected by cosmic variance. The 
best-estimate systematic errors would remain non-zero 
for the faintest magnitude limits in a spectroscopic sur- 
vey, as single line redshift identifications are inherently 
problematic and lead to catastrop hic failures for as many 
as ~ 10% of the galaxies (e.g. see iFernandez- Soto et alJ 
2002). An additional systematic uncertainty, unique to 
spectroscopic surveys and quite difficult to estimate, is 
how spectroscopic incompleteness, most severe for galax- 
ies with weak or absent emission lines, affects the mea- 
sured N(z). 

As is clear from the Table, the field-to-field variance 
dominates the error budget in all cumulative (and most 
differential) magnitude ranges, and as such, the accuracy 
of the median redshifts in the CFDF-PRS is no worse 
than we had undertaken a more costly spectroscopic sur- 
vey. In fact, for the reasons mentioned above it may be 
superior. 

5.5. Comparison to COMBO-17, CNOC2 and RDF 
Median Redshifts 

In Figure lTTl we compare o ur i?-band median r edshifts 
to those of the COMBO-17 iBrown et al.ll200l photo- 
met ric redshift su r vey a nd the CNOC2 ijYee et alJl2000j) 
and ICohen et al.l lj2000|) spectroscopic redshift surveys. 
The median redshifts are plotted versus median i?AB 
magnitudes which are tabulated alongside the various 
sample magnitude ranges in Tabic 

Our results agree fairly well with, and are in fact inter- 
mediate between, the Combo-17 and lCohen et alJ (2000) 
results at the bright end, where the effect of cosmic vari- 
ance is most important. For the CNOC2 comparison we 
compute the median i? AB magnitudes and redshifts from 
the published catalog for the 0223+00 patch, as well as 
for three unpublished patches provided to us by H. Yee. 
To better match the CFDF-PRS, we have limited the 
CNOC2 sample to R AB > 18.5, used the same magni- 
tude ranges as in the present work, and estimated the 
error in the median from the field-to-field variance. Be- 
tween (i? AB ) - 20 and (R AB ) ~ 20.75, the CNOC2 limit, 
the agreement is excellent. The brightest CNOC2 point, 
at (R AB ) >^ 19.5 is at somewhat lower redshift than 
the trend implied by our data, but within the expected 
cosmic variance. 



Rcdshift Distributions in the CFDF-PRS 



Table 3. J A b Median Redshifts in the CFDF-PRS 



Random Errors 



Potential Systematic Effects 
due to Rcdshift Aliasing 









l-CT 


1-cr 


A bcst 


, A bcst s 


Amax 


Amax 


Mag Range 




<*> 


Bootstrap 


Field-to-Field 


(z = 3 -> 0) 


(z = -> 3) 


(z < 1.3) 


(z > 0.2) 


18.5 < Jab < 20.5 


19.88 


0.38 


0.006 


0.035 


-0.00 


0.00 


-0.01 


0.02 


18.5 < Jab < 21.0 


20.36 


0.43 


0.006 


0.065 


-0.00 


0.01 


-0.01 


0.03 


18.5 < Jab < 21.5 


20.79 


0.49 


0.007 


0.075 


-0.00 


0.01 


-0.02 


0.02 


18.5 < Jab < 22.0 


21.23 


0.54 


0.006 


0.070 


-0.00 


0.01 


-0.01 


0.02 


18.5 < Jab < 22.5 


21.68 


0.59 


0.004 


0.055 


-0.00 


0.01 


-0.02 


0.02 


18.5 < Jab < 23.0 


22.11 


0.64 


0.004 


0.031 


-0.00 


0.01 


-0.03 


0.02 


18.5 < J AB < 23.5 


22.59 


0.71 


0.005 


0.040 


-0.01 


0.01 


-0.06 


0.01 


18.5 < Jab < 24.0 


23.11 


0.77 


0.005 


0.049 


-0.01 


0.02 


-0.09 


0.02 



18.5 < Jab < 19.0 


18.76 


0.30 


0.010 


0.023 


-0.00 


0.01 


-0.01 


0.03 


19.0 < Jab < 19.5 


19.27 


0.32 


0.008 


0.010 


-0.00 


0.00 


-0.01 


0.02 


19.5 < Jab < 20.0 


19.79 


0.39 


0.011 


0.023 


-0.00 


0.01 


-0.01 


0.02 


20.0 < Jab < 20.5 


20.28 


0.43 


0.011 


0.085 


-0.00 


0.01 


-0.01 


0.03 


20.5 < Jab < 21.0 


20.79 


0.52 


0.010 


0.072 


-0.00 


0.00 


-0.01 


0.02 


21.0 < Jab < 21.5 


21.28 


0.58 


0.007 


0.050 


-0.00 


0.01 


-0.01 


0.01 


21.5 < Jab < 22.0 


21.78 


0.63 


0.007 


0.040 


-0.00 


0.01 


-0.02 


0.01 


22.0 < Jab < 22.5 


22.27 


0.71 


0.006 


0.036 


-0.01 


0.00 


-0.03 


0.01 


22.5 < Jab < 23.0 


22.77 


0.78 


0.007 


0.031 


-0.00 


0.01 


-0.06 


0.02 


23.0 < Jab < 23.5 


23.27 


0.89 


0.007 


0.031 


-0.01 


0.01 


-0.11 


0.02 


23.5 < Jab < 24.0 


23.78 


1.03 


0.008 


0.061 


-0.04 


0.02 


-0.23 


0.03 


24.0 < Jab < 24.5 1 


24.27 


1.26 


0.010 


0.083 


-0.13 


0.04 


-0.47 


0.06 



Note. — Errors of ±0.00 are smaller than the photometric rcdshift resolution of 0.01. 
This is formally outside the range trusted for accurate photometric redshifts (see text). 



Table 4. R AB Median Redshifts in the CFDF-PRS 



Potential Systematic Effects 









Random Errors 




due to Rcdshift Aliasing 










1-CT 


1-cr 


, A bcst 


, A bcst 


Amax 


Amax 


Mag Range 


(Rab) 


<*> 


Bootstrap 


Field-to-Ficld 


( 2 = 3^0) 


(z = ->■ 3) 


(2 < 1.3) 


(z > 0.2) 


18.5 < Rab < 20.5 


19.79 


0.34 


0.006 


0.012 


-0.00 


0.01 


-0.01 


0.02 


18.5 < Rab < 21.0 


20.26 


0.36 


0.005 


0.012 


-0.01 


0.00 


-0.01 


0.02 


18.5 < R AB < 21.5 


20.71 


0.38 


0.005 


0.032 


-0.00 


0.01 


-0.01 


0.03 


18.5 < Rab < 22.0 


21.16 


0.42 


0.005 


0.065 


-0.00 


0.01 


-0.01 


0.03 


18.5 < Rab < 22.5 


21.65 


0.47 


0.005 


0.075 


-0.01 


0.01 


-0.02 


0.02 


18.5 < Rab < 23.0 


22.12 


0.52 


0.005 


0.075 


-0.00 


0.01 


-0.02 


0.02 


18.5 < Rab < 23.5 


22.59 


0.57 


0.004 


0.065 


-0.00 


0.01 


-0.02 


0.02 


18.5 < Rab < 24.0 


23.09 


0.63 


0.005 


0.042 


-0.00 


0.02 


-0.04 


0.02 


18.5 < Rab < 24.5 


23.61 


0.71 


0.003 


0.030 


-0.01 


0.02 


-0.09 


0.02 



18.5 < Rab < 19.0 


18.78 


0.33 


0.011 


0.040 


-0.00 


0.01 


-0.02 


0.03 


19.0 < Rab < 19.5 


19.29 


0.32 


0.009 


0.023 


-0.00 


0.00 


-0.01 


0.02 


19.5 < Rab < 20.0 


19.78 


0.33 


0.009 


0.010 


-0.00 


0.00 


-0.01 


0.02 


20.0 < Rab < 20.5 


20.29 


0.37 


0.009 


0.021 


-0.00 


0.00 


-0.01 


0.02 


20.5 < Rab < 21.0 


20.78 


0.39 


0.009 


0.065 


-0.00 


0.01 


-0.01 


0.03 


21.0 < Rab < 21.5 


21.27 


0.44 


0.009 


0.068 


-0.00 


0.01 


-0.00 


0.03 


21.5 < Rab < 22.0 


21.78 


0.52 


0.008 


0.085 


-0.00 


0.01 


-0.01 


0.01 


22.0 < Rab < 22.5 


22.28 


0.55 


0.006 


0.076 


-0.00 


0.01 


-0.01 


0.02 


22.5 < Rab < 23.0 


22.78 


0.64 


0.006 


0.025 


-0.00 


0.00 


-0.02 


0.01 


23.0 < Rab < 23.5 


23.28 


0.73 


0.006 


0.023 


-0.01 


0.01 


-0.05 


0.01 


23.5 < Rab < 24.0 


23.78 


0.85 


0.007 


0.015 


-0.02 


0.01 


-0.12 


0.01 


24.0 < Rab < 24.5 


24.28 


1.01 


0.009 


0.040 


-0.04 


0.02 


-0.24 


0.04 


24.5 < R AB < 25.0 1 


24.76 


1.21 


0.010 


0.080 


-0.11 


0.03 


-0.42 


0.06 



This is formally outside the range trusted for accurate photometric redshifts (see text). 
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Fig. 11. — Median redshift vs. median -Rab magnitude in the 
CF DF-PRS (filled sym bols), alo ng with results f or the COMBO- 
17 IBrown e t al. 2003), CNOC2 lYee et alJI2000B and HDF North 
ICohen et al.H20001) surveys , adjusted to AB magnitudes. 



At fainter magnitudes these effects are minimized and 
the comparison becomes more straightfo rward. The 
agreem ent with both the COMB O-l 7 and ICohen et al.l 
(2000) surveys is excellent for all i?AB ~ 21 within the 
quoted errors. For consiste ncy, we transform t he pub- 
lished quartile limits for the ICohen et alJ l|2000|) sample 
into X—a errors ass uming Gauss i an di stributions. While 
we agree with the ICohen et al.l l|2000|) median redshifts 
within these conservatively large error limits, our fields 
clearly show a lower median redshift at faint magnitudes, 
quite in line with the COMBO-17 results. In fact, given 
the strong agreement our results suggest that for ensem- 
ble statistical measurements of the galaxy population, 
such as the two-point correlation function, it is perhaps 
advantageous to employ only ~ 6 broad-band filters 
(with spectral coverage from U through Z bands) and 
image more deeply rather than adopt a brighter medium 
band approach with ~ 15 filters. While the photometric 
redshift error on individual galaxies is likely lower with 
the medium band survey strategy, broad-band photo- 
metric redshift surveys like the CFDF-PRS probe deeper 
down the luminosity function with little penalty on sta- 
tistical measurements. 



6. 



REPRESENTATION OF THE CFDF-PRS REDSHIFT 
DISTRIBUTIONS 



In this Section we present our redshift distribu- 
tions binned in magnitude and redshift, as well as in 
parametrized form in various magnitude ranges. It 
should be noted that none of the main qualitative re- 
sults of this paper change if simple maximum likelihood 
redshift (Peak-ML) values are used in place of the Final 
RLF described in Section 21 

6.1. Binned Redshift Distributions 

In Tables|S]and[|)|we present the measured CFDF-PRS 
redshift distributions, binned so as to present trustwor- 
thy information over all redshifts. Specifically, we tab- 
ulate the Final RLF N{z) in redshift bins of Az = 0.1 
for z < 1.3 and much more broadly (Az = 1.0) at high 
redshift where the photometric redshifts are not as ac- 
curate. Our distributions are presented in 0.5 magni- 
tude bins, which should enable other workers to con- 
struct redshift distributions for any sample in the range 



18.5 < Jab — 24, with the obvious caveat that bright 
samples containing small numbers of objects have rela- 
tively larger uncertainties than deeper samples. 

As an example, in Figure 1121 we plot the binned 
distributions (filled symbols) for the CFRS depth of 
18.5 < 7 AB < 22.5 and the full CFDF-PRS depth of 
18.5 < i A g < 24 along with the full distributions (thin 
solid curves). The binned distributions accurately rep- 
resent the data at all redshifts, although they are highly 
smoothed at z > 1.25. As such our distributions should 
be used in this form for precision (e. g. cosmic shear) 
measurements . 

6.2. Parametrizations of N(z) 

For convenience we provide in this Section 
parametrized fits to our redshift distributions, adopting 
a generalization of the functional form introduced by 
lEfstathiou et all (j!99lTl : 



N(z) = N 



exp 



(4) 



These functions were fitted to our I A b and i?AB redshift 
distributions in various magnitude ranges and the results 
are listed in Tables and El respectively. We limit our 
fits to the redshift region 0.375 < z < 1.3, i. e. we exclude 
the z ~ 0.3 overdensity and fit the main single peak in 
each magnitude range. 

Our fitting technique consists of minimizing the 
Kolmogorov-Smirnov distance D between the cumula- 
tive distribution functions of the model and Final RLF 
method redshift distributions over the allowed redshift 
range. The full three parameter fits [zq, a, 0\ have a 
valu es very close to the com monly used value of 2.0 (see, 
e.g.. lEfstathiou et alJll99l . expected on geometric ar- 
guments for a flat average luminosity function. Without 
significantly compromising the accuracy of the fits, we 
therefore reduce the dimensionality of the analysis by 
setting a = 2.0. The X—a single parameter errors in zq 
and (3, listed in the Tables, are estimated via 100 boot- 
strap resamplings (with repetitions) of the CFDF-PRS 
catalog, and subsequent recalculation of the redshift dis- 
tributions using the Full-RLF method. The separately 
determined absolute number densities, Nq, along with 
bootstrap resampled errors are also listed in the Tables. 

We found Equation 01 to be a sensible model for N(z) 
up to the CFRS depth (I^B ~ 22.5), but at fainter mag- 
nitudes it fails to simultaneously model the main z < 1 
peak and the increasing high redshift tail (at I A B ~ 24 
the fraction of z > 1 galaxies is approximately 30%). We 
therefore limit its application to the z < 1.2 regime and 
fit the z > 1.2 tail of the distribution with a decaying 
exponential, 



N{z) = A T exp 



(5) 



Note that this equation is not fitted to the the Final 
RLF N(z) but rather to the binned distributions cal- 
culated from Tables and The transition between 
Equations 01 and is always very close toz« 1.2, but to 
produce a continuous parametrization should in general 
be taken to be the redshift at which the two parametriza- 
tions intersect. 



Redshift Distributions in the CFDF-PRS 

Table 5. Binned Tab CFDF-PRS Redshift Distributions 



15 



(^ AB )Am=0.5 
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z 


18.75 
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1 1 7Q 
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1 5^7 


9007 


2147 


9907 


9R08 


^9^ 


0.45 


— 0.55 


0.50 


79 


181 


638 


1042 


1718 


1885 


2787 


2657 


2961 


3224 


4384 


0.55 


- 0.65 


0.60 


48 


78 


387 


968 


2024 


2665 


3551 


3735 


4136 


4544 


5453 


0.65 


- 0.75 


0.70 


27 


29 


153 


401 


1229 


2142 


3355 


4519 


5260 


6073 


6573 


0.75 


- 0.85 


0.80 


10 


10 


44 


162 


562 


1419 


2743 


4264 


5526 


7029 


7506 


0.85 


- 0.95 


0.90 


3 


6 


20 


75 


238 


771 


1617 


2967 


4436 


6511 


7402 


0.95 


- 1.05 


1.00 


1 


3 


9 


22 


66 


224 


659 


1506 


2728 


4899 


6616 


1.05 


- 1.15 


1.10 


1 


2 


1 


7 


19 


62 


237 


670 


1448 


2999 


4870 


1.15 


- 1.25 


1.20 





1 


3 


5 


10 


32 


125 


377 


990 


2119 


3764 


1.25 


- 2.25 


1.75 





1 


2 


2 


5 


16 


62 


199 


512 


1183 


2373 


1.75 


- 2.75 


2.25 


6 


1 


5 


8 


8 


11 


40 


85 


204 


507 


1188 


2.25 


- 3.25 


2.75 


18 


7 


9 


15 


21 


19 


17 


70 


132 


292 


645 


2.75 


- 3.75 


3.25 


11 


3 


5 


8 


13 


10 


16 


21 


48 


116 


289 


3.25 


- 4.25 


3.75 


1 








1 





2 








3 


10 


51 
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- 4.75 


4.25 

















3 








1 


6 


25 



Note. — The redshift distributions are presented in constant 0.5 magnitude bins between 18.5 < T~AB — ^4, whereas the 
Az — 0.1 bin size in redshift is expanded to overlapping Az = 1.0 bins at z > 1.25. In each redshift bin the number of galaxies 
in the Am = 0.5 magnitude range is computed as TV = J^ h N(z,mi < Ia^B < mh)dz/ f* h dz, where mi — (^ab) — 0.25, 
mj, = (Ij^'q) + 0.25, and the number densities, N, represent the CFDF-PRS survey area of 0.4514 deg 2 . 



Table 6. Binned R A b CFDF-PRS Redshift Distributions 



(Rab) 



Am=0.5 





- z h 


z 


18.75 


19.25 


19.75 


20.25 


20.75 


21.25 


21.75 


22.25 


22.75 


23.25 


23.75 


24.25 


0.05 


- 0.15 


0.10 


128 


150 


183 


168 


318 


279 


306 


408 


429 


743 


1051 


1956 


0.15 


- 0.25 


0.20 


230 


303 


432 


443 


642 


782 


794 


1171 


1125 


1499 


1722 


2780 


0.25 


- 0.35 


0.30 


488 


706 
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1243 


1553 


1648 


2088 
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2279 


2475 


3240 
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- 0.45 


0.40 
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410 
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1913 
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2790 


3410 
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0.50 


92 
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222 


630 


1016 


1802 


2240 


3031 


3096 


3109 
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4204 
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- 0.65 


0.60 


13 


93 


114 


302 
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1587 


2755 


3632 


4382 


4292 


4591 
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0.65 
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0.70 


11 


32 


40 


63 


211 


558 


1615 


2618 


4373 


5370 


5893 
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0.75 


-0.85 


0.80 


2 


8 


9 


13 


43 


97 


502 


1339 


2882 


4900 


6216 


7061 


0.85 
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0.90 
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3 


3 


14 


22 


133 
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1624 
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6597 


0.95 
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1.00 





2 


1 


2 


8 


5 


10 
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3377 


5318 


1.05 
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1 
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80 
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1 
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11 
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1 


9 


27 
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19 
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84 
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20 


8 


10 


13 


20 


16 


27 


45 


68 


139 


297 
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2.75 


- 3.75 


3.25 


14 


7 


5 


7 


12 


9 


12 


15 


27 


11 
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341 
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1 


1 
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2 
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Note. — All quantities are as in defined in Table with the addition of the distribution for the 24.0 < -R^g < 24.5 bin in 
the final column. 



For the high redshift tail the dominant errors are likely 
systematic in nature, coming from the photometric red- 
shift algorithm operating in a redshift and magnitude 
regime where the convergence properties are highly un- 
certain. Without a large, deep spectroscopic compari- 
son sample at 1 < z < 3 it is difficult to assess these 
errors in a meaningful way. In the present work we 
conservatively assign a maximum uncertainty of 50% to 
the fit in the z > 1.3 regime. Examples of the fits 
are shown in Figure ^| for the 18.5 < I AB < 22.5 and 
18.5 < I AB < 24.0 samples. The solid, thick curve is the 



best fitting parametrization of the form of Equation 0] 
within the 0.375 < z < 1.3 fitting region, although it is 
likely correct at z < 0.375 (dotted curve) due to the con- 
straint at z = 0. At z > 1.2 the best-fit parametrization 
of the form of Equation [5] is represented by the long- 
dashed line. Taken together the full redshift range is 
well parametrized by these functions. 

7. SUMMARY 

We have introduced a new technique to compute N(z) 
in which the full photometric redshift likelihood functions 
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Fig. 12. — Binned distributions (solid symbols) and best-fit paramctrizations plotted along with the 18.5 < I ab < 22.5 and 18.5 < Iab < 
24 RLF redshift distributions (thin solid curves), scaled to 1 square degree. The solid thick line shows the best fitting parametrization 
of the main peak (Equation [IJ at z < 1.2 (represented as a dotted curve outside the fitting range, i. e. at z < 0.375). At z > 1.2 the 
long-dashed curve is the best fitting model of the high redshift tail f Equation 151. The CFRS histogram is included for reference as the 
(dotted) histogram, where once again the dotted square represents the unidentified fraction of CFRS objects. 



Table 7. Parametrized Fits of Tab CFDF-PRS Redshift Distributions 



Mag Range 






z 


< 1.2 








z > 1.2 




JV 






/3 




A T 


z T 


7 


18.5 < /ab 


< 24.0 


20785 


(133) 


0.790 


(0.008) 


2.788 


(0.059) 


11443 


1.786 


2.037 


19.0 < / A b 


< 24.0 


20735 


(141) 


0.793 


(0.009) 


2.805 


(0.067) 


11472 


1.783 


2.039 


19.5 < / AB 


< 24.0 


20625 


(143) 


0.800 


(0.008) 


2.852 


(0.058) 


11503 


1.780 


2.037 


20.0 < /ab 


< 24.0 


20310 


(135) 


0.818 


(0.009) 


2.969 


(0.074) 


11557 


1.776 


2.033 


20.5 < Iab 


< 24.0 


19800 


(126) 


0.840 


(0.008) 


3.123 


(0.073) 


11609 


1.773 


2.032 


21.0 < / AB 


< 24.0 


18861 


(141) 


0.872 


(0.007) 


3.341 


(0.074) 


11695 


1.770 


2.031 


21.5 < /ab 


< 24.0 


17569 


(128) 


0.902 


(0.007) 


3.529 


(0.093) 


11635 


1.782 


2.053 


22.0 < /ab 


< 24.0 


15573 


(120) 


0.945 


(0.008) 


3.817 


(0.121) 


11672 


1.776 


2.043 


22.5 < /ab 


< 24.0 


13046 


(111) 


0.978 


(0.009) 


3.904 


(0.135) 


11131 


1.788 


2.045 


23.0 < /ab 


< 24.0 


9829 


(90) 


1.015 


(0.008) 


3.992 


(0.122) 


9383 


1.837 


2.074 



18.5 < Iab < 21.5 


3216 


(169) 


0.609 


(0.010) 


3.200 


(0.096) 


41 


3.717 


93.92 


18.5 < Iab < 22.0 


5205 


(165) 


0.644 


(0.008) 


3.175 


(0.075) 


109 


3.460 


14.55 


18.5 < Iab < 22.5 


7759 


(186) 


0.683 


(0.009) 


3.119 


(0.068) 


658 


2.084 


2.327 


18.5 < Iab < 23.0 


10979 


(168) 


0.720 


(0.008) 


3.048 


(0.067) 


2422 


1.716 


1.994 


18.5 < Iab < 23.5 


15293 


(157) 


0.760 


(0.008) 


2.941 


(0.066) 


5889 


1.699 


2.013 


18.5 < Iab < 24.0 


20785 


(133) 


0.790 


(0.008) 


2.788 


(0.059) 


11443 


1.786 


2.037 



Note. — The fits are for the CFDF-PRS survey area of 0.4514 deg 2 . At z < 1.2 the best-fitting parameters 
of Equation 0] are listed, along with bootstrap-resampled errors. At z > 1.2, the parameters of fits of the form of 
Equation are tabulated. We conservatively estimate the error in the high redshift tail to be 50%. 



for each galaxy are incorporated to better reproduce the 
correct underlying redshift distribution. Direct summa- 
tion of the likelihoods produces the Bayesian prior which 
accounts for the fact that all redshifts are not a priori 
equally likely. No information external to the survey is 
used in the Bayesian technique, rather we iterate within 
our own dataset. We have presented Monte Carlo sim- 
ulations which prove the validity of the technique and 
demonstrate that it is a significant improvement over 
previous methods. 

Our highly accurate photometric redshifts, calibrated 
using hundreds of spectroscopic CFRS galaxies, have 
typical dispersions of only a/ (1 + z) < 0.06 to I^B = 24 
for z < 1.3. Our large field sizes (30') and multiple, 
widely separated lines of sight produce redshift distribu- 
tions far less affected by cosmic variance than previous 



surveys of similar depth. We compute Ja^B an< ^ ^-AB me ~ 
dian redshifts, as a function of limiting magnitude and in 
differential magnitude bins, with an error budget consist- 
ing of bootstrap resampled random errors, field-to-field 
variance and an empirical estimate of the systematic er- 
rors due to photometric redshift aliasing. We present 
our Ja^B and i?AB redshift distributions in tabular form 
and provide parametrized fits with errors estimated from 
bootstrap resampling. 

This work confirms that in the redshift regimes in 
which multicolor imaging surveys have appropriate wave- 
length coverage and depth, the errors in photometric red- 
shifts (conservatively Az/(l + z) < 0.1) are not the limit- 
ing factor in an accurate determination of N(z). In addi- 
tion, we have verified that the systematic errors inherent 
in the method are secondary to cosmic variance in the 
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Table 8. Parametrized Fits of _R A b CFDF-PRS Redshift Distributions 



Mag Range 






2 


<1.2 








z > 1.2 




N 




ZQ 


P 




A T 


z T 


7 


18.5 < Rab 


< 24.0 


13446 


(133) 


0.662 


(0.012) 


2.601 


(0.081) 


2380 


2.433 


3.083 


19.0 < Rab 


< 24.0 


13379 


(121) 


0.667 


(0.011) 


2.629 


(0.073) 


2415 


2.407 


3.034 


19.5 < Rab 


< 24.0 


13288 


(113) 


0.673 


(0.010) 


2.659 


(0.065) 


2425 


2.399 


3.025 


20.0 < Rab 


< 24.0 


13151 


(113) 


0.681 


(0.012) 


2.710 


(0.081) 


2447 


2.384 


2.999 


20.5 < Rab 


< 24.0 


12859 


(110) 


0.701 


(0.010) 


2.837 


(0.077) 


2471 


2.367 


2.969 


21.0 < Rab 


< 24.0 


12429 


(122) 


0.725 


(0.010) 


2.993 


(0.087) 


2507 


2.343 


2.927 


21.5 < Rab 


< 24.0 


11659 


(100) 


0.761 


(0.009) 


3.232 


(0.091) 


2549 


2.325 


2.895 


22.0 < Rab 


< 24.0 


10532 


(113) 


0.796 


(0.010) 


3.434 


(0.115) 


2591 


2.302 


2.881 


22.5 < Rab 


< 24.0 


8896 


(96) 


0.848 


(0.008) 


3.812 


(0.122) 


2624 


2.266 


2.839 


23.0 < Rab 


< 24.0 


6661 


(78) 


0.894 


(0.010) 


4.036 


(0.153) 


2532 


2.228 


2.770 



18.5 < Rab < 22.0 2992 

18.5 < Rab < 22.5 4607 

18.5 < Rab < 23.0 6800 

18.5 < Rab < 23.5 9632 

18.5 < Rab < 24.0 13446 

18.5 < Rab < 24.5 18656 



(170) 


0.538 


(0.011) 


3.140 


(150) 


0.567 


(0.010) 


3.116 


(150) 


0.602 


(0.011) 


3.007 


(145) 


0.634 


(0.013) 


2.825 


(133) 


0.662 


(0.012) 


2.601 


(146) 


0.665 


(0.012) 


2.309 



(0.111) 28 3.716 76.46 

(0.084) 70 3.670 45.30 

(0.083) 186 3.432 13.82 

(0.098) 612 2.982 5.643 

(0.081) 2380 2.433 3.083 

(0.060) 5930 2.337 2.778 



Note. — All quantities are denned as in Tabled For the -R^B sample we also provide a fit for the 18.5 < Rj^-q < 
24.5 magnitude range. 



overall error budget. Sparsely sampled spectroscopic or of effectively independent fields will be required to sig- 
wide-field photometric redshift surveys containing dozens nificantly improve upon the present results. 

APPENDIX 

CONVERGENCE PROPERTIES 

This Appendix presents some of the tests conducted, beyond the Monte Carlo simulations presented in Section fOl 
to determine the redshift and S/N ranges for which the new method converges. 

Formalism 

Generalizing Equation [21 to include a single prior, P(z), for a given sample 

\ - &{z s 

J^P(z)C i (z)dz' 



Af(z)=P(z)J2 r°o T^hz^Z > ( Al ) 



we see that the correct prior is the one for which N(z) = N[z) (i. e. we recover the correct underlying redshift 
distribution). This leads to the following integral equation for P(z): 

which can be solved by iteration. Note that the functional form of the ideal prior depends on the likelihood functions, 
as expected since at very low S/N (reflected in the likelihoods) the method should fail. Viewing the prior as expressing 
a change of coordinates to a space in which the galaxy distribution is flat, one might expect the prior to be similar 
to N(z). Therefore, we can use simulations to determine the redshift and S/N regimes in which taking P(z) = N(z) 
produces the correct answer (i.e. N{z) = N(z)). It is straightforward to show that this requires 



When this equation is satisfied, using N(z) as the prior is strictly correct. Although in general N(z) is not known a 
priori, assuming that N°(z) from Equation[5]is a reasonable approximation to N(z), one might still expect that 



(A4) 



is a valid convergence criterion. 

In the CFDF-PRS we set a conservati ve / -ba nd S/ N minimum of 1 0, abo ut I^B ~ 24, based on a direct calculation 
of the convergence criteria of Equations IA3I and IA4I (shown in Figure |A13(I . as well as the results of the Monte Carlo 
simulations described in the text. The redshift range over which the method is most reliable to this depth, apparent 



18 



B rod win et al. 




Fig. A13. — Convergence criteria r\ and rp of Equations IA3l and lA4l calculated in the Monte Carlo simulation of Section l4.3l The unshaded 
region between 0.2 < z < 1.3 is identified as the most trustworthy for recovering the underlying redshift distributions for magnitude limited 
samples down to 7ab ~ 24. Nevertheless, the redshift distribution can also be measured outside of this redshift range, though slightly less 
accurately, as shown in the next Section. The curves represent samples down to limiting magnitudes of /ab < 22.5 (directly above arrow), 
rising to /ad < 24 in increments of 0.25 magnitudes. The curves in the top panel are undefined where N(z) = for bright magnitude limits 
at high redshift. The smoothing effect of the photometric redshift errors prevents this occurrence in the lower panel. 



from Figure 1^,131 is 0.2 < z < 1.3, corresponding to values of r\ and rp within 10%-1 5% of unity. This redshift range 
is in very good agreement with the predictions for our filter set i|Brodwin et al.lll999|) . 

We are grateful to Chuck Steidel and Howard Yee for making their spectroscopic data available to us prior to 
publication, Chris Pritchet for observing some of our CFH12k data and Frank Valdes for assistance with MSCRED. 
MB would like to thank Ue-Li Pen, Henk Hoekstra, Barth Netterfield, and Peder Norberg for useful discussions. CP 
was supported by the Zwicky Prize fellowship program at ETH-Zurich. MB acknowledges support from the National 
Sciences and Engineering Research Council and the Ontario Graduate Scholarship program, as well as ETH-Zurich 
where much of this research was carried out. 
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